List of AI News about multimodal factuality in AI
| Time | Details |
|---|---|
|
2025-12-10 19:04 |
Gemini 3 Pro Leads AI Model Benchmark with 68.8%: Multimodal Factuality Remains a Challenge, According to Google DeepMind
According to @GoogleDeepMind, a comprehensive evaluation of 15 leading AI models showed Gemini 3 Pro achieving the highest score of 68.8%. The assessment highlighted that while search capabilities and internal knowledge have improved across models, the challenge of ensuring multimodal factuality persists industry-wide. Google DeepMind is sharing these benchmarking results on Kaggle to support the research community in developing more robust and reliable AI systems. This initiative aims to drive practical advancements in AI model reliability and accuracy for enterprise and research applications. (Source: @GoogleDeepMind, Dec 10, 2025, goo.gle/4aEUD4b) |